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What Is Claimed is: 



Aprogfram conversiou apparatus for converting a gi ven 



:»ource program into a program for a multi- thread processor 
including a plurality of program couuterb and a plurality nf 
thread execution app«ratits , said plurality of thread execution 
^aratus being operable to fetch, decode and execute a plural i ty 
uf instructions of threads simultaneously in accordance with 
said plurality of program coxinters such vhat \z is possihiR 
t/o eAttcuce, after a i:Jire;iri is created, the thread m a controi 
speculative mode wherein a change hayxn^ had an effect on a 
register set can be cancelled later and to execute the thread 
in a data -dependent speculative mode wherein, when, after ^ 
belf thread loads a va Ine from a memory location, a parent thread 
by which the self thread has been created stores a value into 
the same memory location, at least a processing result of the 
self thread after the load is abandoned and the processing is 
re-executed, said mniti -thread processor having an instruction 
set with which it can be executed by a single machine instruction 
or a combination of several machine instructions for a thread 
being executed by any of said thread execution apparatus tn 
create a new thread of the control speculative mode, to end, 
if a designated condition is satisfied, the self thread j^nd 
clear the control sspecnlativ© mode of a thread of the control 
speculative mode created by the self thread, to abandon the 
created thread of the control speculative mode, to give, when 
a thread created by the self thread perfurms load from a memnry 
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location of a designaterl address, an instruction in advance 
to temporarily block the operation, Lu clear the load temporary 
blocking Ins tract inn to the designated memory addross, for the 
thread boing executed by the thread eaecuLion apparatus to create 
a nw thread of the data-dependent speculative mode and to clear 
the data -dependent speculative mode of Lhe chread of the 
daca^depCTdent specuifltive mode created by the self thread, 
said program conversion apparatus cumprising: 

a register al location trial section for trying register 
allocation prior to paralielizatiuii Lu estimate a register 
allocation situation nt variables and intermediate torms of 
an intorraediatc program; 

a fork spot determination section for determining based 
on a result of the register allocaLiuu trial by said register 
allocation trial section whether or not a conditional branch 
portion of the intermediate program shuuld be converted into 
a parallel code for which a thread creation instruction is used 
and determining a parallelization execuLion method with the 
parallel code; 

an instruction reordering sec Lion for converting the 
conditional branch portion in the intermediate program into 
a parallel code for which the thread creation instruction is 
used based on a result of the determination by said fork spot 
determination section and referring to the result of the register 
allocation triaJ rn insert an instruction for assuring a 
data -dependence relationship between threads through a memory 
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into positions before and after the thread creation instruction 
and reorder the instructions before aud alter the thread creation 
instruction so that, thread creation may be performed in an early 
stage; and 

a register allocation section for performing definite 
register allocation so that , regardiuj? wht»ther or not a physical 
register is allocatRri to the parallelized and reordered 
instruction sequence, zhe same aiiocaLiuii result as "hat upon 
the register allocatinn ztisll may be obtained. 

2. A program conversion apparatus as claimed in claiTn 
1, wherein said forte spot determination section investigates 
a data dependence relationship Ihruugh a memory from a basic 
block in the IntermRdi ;^te program which is a processing object 
at present to each of basic blocks of branching destinations 
of a conditional branching instruction positioned at the tail 
end of the basic block, coimts, fur each of the branching 
destinations, the instruction step number, from the top of the 
branching destination basic block, of Che instruction at the 
top one of memory reference instructions in the branching 
destination basic block which cause the data dependence, and 
selects that one of the branching destination basic blocks whose 
instruction step number is greater as a new thread to be executed 
parallelly . 

3. A program conversion apparatus as claimed in claim 
2, wherein said fork apnr. determination section determines the 
position of a data- dependent instruction through a memory in 



each branching destination basic block using a value obtained 
by accumulating estimated execution cycle numbers of 
instructions in place of the instruction step number. 

1, A program conversion appaiatu^ as claimed in claim 
1, wherein, upon converflinn from a source program into a target 
program first, address coordination information for 
establishing conrrii nation between the basic blocks of the 
intermediate program in said program cunversion apparatus and 
machine language addresses of the target program ro be outputted 
is outpurrcd together wizh zli^ vax#<ei program, and i carge* 
program execution apparatus reads in the target program and 
the address coordination inforiuaLiun and executes the target 
program and th fin outputs profile information including branch 
profile information between batiic blocks upon the exeoutinn 
of the target program ;^nd data dependence information occurring 
through a memory between the basic blocks, whereafter, when 
said program conversion apparatus parallelizes the source 
program to convert the source pru^ram into a target program, 
said fork spot determination section refers to the profile 
information to preferentially select a branching destination 
basic block to whinh control flows in a high probability at 
a conditional branch and another branching destination basin 
block with whirh data dependence occurs in a low probability 
at a conditional branch as autjw thread to be executedparal 1 el 1 y . 

5. A program conversion apparatus as claimed in claim 
>l, wherein said fork spot detexmiiiation section produces an 



instruction to oahsp. a conditional branching destination basic 
block selected as an execution sLart point of the new thread 
to be executed paral 1 eiiy to teaporarily block, ^cn the number 
of spots of different meoory addresses whicli c^usie date 
dependence is shwI ler than a predetermined number based on a 
result of an analysis of data dependence throu^ a raemnry in 
the intermediate program and a data dependence occurrence 
probability obtaxned from the profile information, Inad 
operation of clifi new thread from cho memory addresses, but 
investigates, when the number uf spots of different memory 
addresses which nause data dependence is equal to or greater 
than the predetermined ntimber , whether ornox the data dependence 
occurrence probability is lower than a predetermined 
probability and produces, if the probability is lower, an 
instruction to create a new thread in the data -dependent 
speculative mode but controls, if the probability is equal to 
or higher than the predetermined probability, so as to stop 
the parallclizQtion conversion at Lhe spot. 

6. A program conversion apparatus as claimed in claim 
4, wherein said fork spot determination section invG<?ti gates 
a data dependence relationship through a memory from the basic 
block in the intermediate programcurrently which is a prncesfsing 
object at present to each of the branching destination basic 
blocks of the conditional brauchiutt instruction positioneri at 
the tail end of the basic block and synthesizes the investigated 
data dependence relationship and the conditional branching 



probability obtained from thp protiie information, and if a 
result of the synthesis reveals that the branching probabilities 
rexarding the branching deflti nation basic blocks at the 
co nd itional branch do not have a difference greater tlian a 
predeteriuined amount and data dependence occurrence timings 
through a memory do not have a difference greater than a 
predetermined amount, said fnrk .spot determination section 
determines ao as not ro parallelize the couditiuxial branching 
porliua. 

7, A program conversion apparatus, coraprisiiifc; : 

a syntax analysis section for analyzing the syntax of 

a source program to produce an intermediate pruKriiiu; 

d parallelization section for performing optimization 

processing including parallelization for iht; intermediate 

program; and 

a code generation section for producing a target program 
including an instruction code for a target processor apparatus 
from the intermediate program optimized by said parallelization 
section; 

said parallelization section including an intermediate 
program inputting section for reading in the intermediate 
program and analyzing a control flow and a data flow , a register 
allocation section for trying to perform register allocation 
prior to parallelization to estimate a register allocation 
situacion of variables anrt intermediate terms of the 
intermediate program and executing allocatiou of registers. 
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a fork spot determinating section for determining, baeed on 
a result of the trial of the register allocaLiuu, a spot of 
d conditional branch portion of the intermediate program to 
be converted into a parallel code for which a thread creation 
instruction is used, an instruction reordering section for 
performing reordering of instructions before cuid after i:lie 
parallelization spot from information of the parallelization 
spot determined by said fork spot deterraiiialiuii section, rhe 
data flow and so forth, ^na an intermediate program outputting 
section for outputting the instnicLiou aeuuence for which the 
conversion including the naraiielization has been completed 
in a format of t;hc intermediate program again. 

'3. Aerogram conversion apparatus as claimed m claim 
7, wherein said parallelization section includes a profile 
information input ting sect i on for receivingprofilo information 
outputted from the target processor apparaLuii as a result of 
execution of the target program and converting the profile 
information into information of an internal foriuaL, and said 
fork spot determination section determines, based on the result 
of the register allocation trial and the profile information, 
a spot of a conditional branch portion of the intermediate program 
to be converted into a parallel code iu which the thread 
(generation code is used and determines a parallelization 
execution method by the parallel code. 

9. A program conversion apparatus as claimed in claim 
7, wherein said instruction reorderiuK seccion converts 
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conditional ftranchi Tig portions in the intermediate program into 
a parallel code in which the thread cx-ealiun instruction is 
used baaed on a result of r.h« determination by said fork spot 
determination section, refers to the reaull uf Lhe register 
allocation trial to insert an instruction for assuring a data 
dependence relationship between threads Lhiuugh a memory into 
positions before and nft.Piv tue thread creation instruction and 
reorders the instructions before and after the thread creation 
instruction so chat thre«ri creation may be performed in an early 
stage . 

10. A program conversion apparatus as claimed in claim 
7, wherein said register allocation section performs definite 
register allocation .sn that , regarding whether or not a physical 
register is allocated to the parallelized and reordered 
instruction sequence, the same allocation result as that upon 
the register allocation trial may be oblaiiied. 

11. A program conversion apparatus for performing 
optimization processing including parallelization of an 
intermediate program obtained by a syntax analysis of a source 
program performed by a syntax analysis section so that the 
intermediate program may be suitable for a target proccseor 
apparatus , comprising : 

register allocation trial means for trying allocation 
of registers of the target proceasur apparatus on the 
intermediate program and obtaining register allocation 
information prior to actual allocation; 
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means for calculating a distance nf data dependence 
generated through a memory in the target processor apparatus 
for the intermediate program: 

means for determining a fork designation taking the 
3 distance of data dependence throu^ a meronry on the intermediate 
program into consideration and replacing a conditional branch 
with a thread creation instruction; and 

means for referring zo a result of the register allocation 
trial zo reorder the instnictioTis before and after rhe thread 
10 creation instruction on the intermediate program. 

12. A program conversion appflrarns according to claim 
11, wherein said means for replacing a conditional branch with 
a thread creation instnictinn includes: 

means for calculating a miniratim value of data depeadence 
15 of intermediate terms and vari ahl e.s for each of the two branching 
destinations of the conditional branch; and 

means for comparing the two minimum va lues of the distance 
of data dependence determined for the two branches of the 
conditional branch, determining, when the two minimum values 
20 have a difference greater than a predetermined value, the 
branching direction of the branch which exhibits the higher 
minimum value of the distance of data dependence as a fork 
destination and selecting the conditional branch spot as a fork 
spot, but determining, when the twominimumvalues of the distance 
25 of data dependence do not h^ivp a difference equal to or greater 
than the predetermined value, that branching destination which 



has been a branching destination in thf» original intermediate 
program as a fork destination and selecting the conditioual 
branch spot ab a fork spot candidate. 

13 . A program conversion apparatus as claimed in claim 
11, further cumprising: 

means for receiving profile information outputted from 
a prucessor apparatus which executes rhe target program 
outputted from said program conversion apparatus and 
calcuiatiu^i <^ uontliT;ional branching probabi 'nry and a aata 
dependence occurrence frequency from the profile information; 
and 

means for determining a fork destination and a data 
dependence assurance system from the distanrp of data dependence , 
the conditional branch probability and the data dependence 
occurrence frequency , and the number of spots of different memory 
addresses which cause data dependence and replacing the 
condxLional branch with the thread creation instruct ion . 

14 . A program conversion apparatus as claimed in claim 
7, wherein the target processor apparatus is a miiti-thread 
processor which includes a plurality of program coimters and 
a plural! Ly of thread execution apparatus, said pUiraiity of 
thread execution apparatus being operable to fetch, decode and 
eiecuLe a plurality of instructions nf thrf»fld$ simultaneously 
in accordance with said plurality of program counters auch that 
it ia yussible to execute, after a thread is rreated, the thread 
in a control speculative rnode therein a change having had an 
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effect on a register set can be canceled later and to execute 
the thread in a data -dependent speculative mode wherein, when , 
after a self thrend loads a value from a memory location, a 
parent thread by which the self thrtiad has been created stnrejs 
a value into the same mmory location , at least a processing 
result of the self thread after the load ib aband<med and the 
processing is re-executfid , said multi-thread processor having 
an instruction set with which xx. cdu be executed by a single 
machine instnictinn or a combination of several machine 
instructions for a thread being eAecuLed by any of said thrRad 
eAecubion apparatus to create a new thread of the control 
speculative mode, to end, if a designated cundition is satisfied , 
Lhe self thread and dear the control speculative mode of a 
thread of the control speculative mode ureatedby the self thread , 
Lo abandon the created thread of the control speculative mode, 
to give, when a thread created by the aelf thread performs load 
from a memory location nf a designated address, an instruction 
in advance to temporarily block the operation, to clear the 
load temporary blocking instruction to the designated memory 
address, for the thread being executed by the thread execution 
apparatus to create a new thread of the data-dependent 
speculative mode and to clear the da La- dependent speculative 
mode of the thread of the data-dependent speculative mode created 
by the self thread. 

15. A program conversion method for performing an 
optimization process including yarallelization for Pin 
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intermediate program nnr.piitted as a result of a syntax analysis 
on a program conversion apparatus which compileb a source program 
and outputs a target program f nr a target processing apparatus 
of the multi- thread type, compriaing: 

a register allocatioTi trial step of trying register 
allocation prior to parallelization to eatiumle a register 
allocation situation of variahl.es and intermediate terms of 
the intermediate program; 

a fork spot determination srep of determining based on 
a result of the register allocation :rlai -vhelher or not a 
conditional branch porr.ion of the intermediate program should 
be converted into a parallel code for whicii ct thread creation 
instruction is used or pftrfnrming determination of whether or 
not the conditional branch portion should be coiiverLed into 
a parallel code and. when surh conversion should be performed, 
determination of a parallelization execution uielhod; 

an instruction reordering step of converting the 
conditional branch portion in the intermediate prujbcram into 
a parallel code for which the thread creation instruction is 
used based on a result of the determination by Lhc fork spot 
determination step and r«t'<3Tring to the result of the register 
allocation trial to insert an instrucLiuu for assuring a 
data-dependence relation ship between threads through a memory 
into positions before and after the thread crealiua ius> cruet ion 
and reorder the instrucr.i nns before and after the thread creation 
instruction so that thread creation may be perfuriued in an early 
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» Cage ; and 

a register allocation 3tcp of performing definite 
register allocation so tliat the same allocation result as that 
upon the ragister allocation trial may be obtained for the 
paxallelized and reordered instruction sequence. 

16. A. program conversion method as claimed in claim 15 , 
wherein the fork spot determination step means caimilatps ?i 
minimum value of data dependence of intermediate terms and 
variables for each of the two branching desstlnations nf rhe 
conditional branch, compares ~hc two minimum values ox the 
distance of data dependence determined for the two nrannhft.q 
of the conditional branch, determines, when the two minimum 
values have a difference greater than a prRdeterTmned value, 
the branching direction of the branch which exhibits the higher 
minimum value of the distance of data dependence as ;j fork 
destination and selects the conditional branch spot as a fork 
spot . but determines , when the two minimum values of the di stanne 
of data dependence do not have a difference equal to or greater 
than the predetermined value, that branching destination which 
has been a branching destination in the original intermediate 
program as a fork destination and selects the conditional hranrh 
spot as a fork spot candidate. 

17. A program conversion method as claimed in claim 15, 
wherein the fork spot determination step investigates a data 
dependence relationship through a memory from a basic block 
having no branch and no confluence from within the intermediate 
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program wbicti is a processing object at present to each of basic 
blocks of branching destinations of a condiLiuual branching 
iastractionposltionedatthfftai f endof thabasicblock, counts, 
for each of the branching destinations, the iu» traction step 
number, froifl the top of thfi branching destination basic block, 
of the instruction at the top one of memory refereuct; iabtructions 
in the branching destinatinn hasic block which cause the data 
dependence, and sclccta t;hat one of the branching desliiiiition 
basic blocks whose instmcrion step number is greater as a new 
thread to be executed paralielly. 

18. A program conversion method as claimed in claim 17. 
wherein the fork spot determination step deiex-miueb the position 
of a data-dependent instruction through a memory in each 
branching destination basic block usIuk a value obtained by 
accumulating est imatftcl execution cycle nxambers of instructions 
in place of the instruction step aujuber. 

19. A program rnnversion method as claimed in claim 15, 
wherein, upon conversion from a source program iiiLu a target 
program first by said program conversion apparatus, address 
coordination information for establishing cuurdinati on between 
the basic blocks of the intermediate program and machine language 
addresses of the target program to be outputted is outputted 
together with the target program, and a processor apparatus 
which is to execute the object program reads in the target program 
and the addref^s morrlination information and executes the target 
program and then outputs profile iurormalion including branch 
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profile information betwRRn basic blocks upon the execution 
of the target program and data dependence iarormaLion occurring 
throusb a me0ory between the basic blocks, whereafter, when 
said program conversion apparatus parallelizes the source 
prujtjrram to convert the sonrcfi program into a target program, 
the fork spot determination step refers to Lhe profile 
information to preferential ly select a branching destination 
basic block uo which conrroi flows m a aiKii urobaDiiity at 
d conditional branch and ;^nother oranchmg destination basic 
olock with which data dependence occui*3 in a ^uw prubabiiity 
at a conditional branch as a tirw thread to be executed parallel ly . 

20. A program conversion method as claimed in claim 19. 
wherein the fork spot detRrmi nation step produces an instruction 
to cause a conditional branching destiuation basic block 
selected as an execution start point of the new thread to be 
executed parallclly to temporarily block, when Lhe number of 
spots of different memory addresses which cause date dependence 
is smaller than a predetermined number based un a result of 
an analysis of data dependence through a memory in the 
intermediate program and a data depeudeuce occurrence 
probability obtained from the profile information, load 
operation of the new thread from the memory addresses, but 
investigates, when th« number of spots of different memory 
addresses which- cause data dependence is equal to or greater 
than the predetermined nnmher , whether or not the data dependence 
occurrence probability is lower than predetermined 
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probability and produces, if the probability lower, an 
instruction to create a new thread in the data -dependent 
speculative mode but controls, if the probability is equal tn 
or hi^er than the predetermined probability, so aa to atop 
the parallel ization conversion at the spot . 

21. A program conversion apparatus as claimed in claim 
19, vvkereiu the fork spot determination step investigates a 
data dependence relationship chrough a mcniory from whe basic 
block iu the intermediate program currently which is aprnr.essi ng 
object at present zo each of the branching destination basic 
blucks of bhe conditional branching instruction posi ti nnRri at 
the tail end of the basic block and synthesizes the investigated 
data dependence relationship and the conditional branching 
probability obtained from the profile information, and if a 
resulL uf Lhe synthesis reveals that the branching probabi 1 i ti as 
regarding the branching destination basic blocks at the 
coiidlLiunal branch do not have a difference greater than a 
predetermined amount and data dependence occurrence timings 
through a memory do not have a difference greater than a 
predetermined amount, said fork spot determination section 
deLermiaes so as not to parallelize the conditional hranrhing 
portion. 

22. A program conversion method as claimed in claim in, 
wherein the fork spot determination step includes the steps 

of: 

discriminating whether or not the conditional branching 
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iusLruction corresponds to a return branch nt a loop structure 
in the intermediate program; 

dtstermininK, when the conditional branch ing instruction 
corrasponds to a loop return branch, the direction of the return 
branch, which is a loop continuing direction, as a fork 
destination and selecting the conditional branch spot as a fork 
spot ; 

but calculating, when the conditional branching 
iustrucLioxi IS aox a loop return branch, a mimmiim vaiue oi 
the distance of data dependence of intermediate terms { variables 
fur each of the two branch dest inat ions of the conditional branch : 

comparing the two minimum values of the distance of data 
dependence determined with regard tn rhp two branches of the 
conditional branch with each other to discriminate whether or 
not Lhe Lwu minimum values have a difference graarer than a 
predetermined value ; and 

determining, when the two minimum valufts of the distance 
of data dependence have a difference greater than the 
predetermined value, the branch which exhibits a larger one 
of the minimum values of the distance of data dependence as 
a fork destination and selecting the conditional branching spot 
as a fork spot; but 

determining, when the two minimnm values of the distance 
of data dependence do not have a difference greater than the 
predetermined value, that one of the branrhes which has been 
a branch destination in the original intermediate program as 
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a fork destination and selecting tlie conditional branrhiTig spot 
as a fork candidate. 

23. A program conversion method as claimed in claim 22, 
^xdierexn the distance of data dependence is the number of steps 
ill Lhe intermediate program which represents at whur. dint^Tice 
from the top of a basic program of the branching destination 
the lueuiory reference instruction is positioned with regard tn 
each of the intermediate terms and variables which are defined 
in the basic block which is a processing object prRaftnr. and 
may possibly be referred to in the branching destination and 
besides are estimated to be arranged on the mftmory. 

24 . A program conversion method as claimed in claim 22 , 
wherein, upon the determination nf rhR distance of data 
dependence , the number of cycles estimated to be required when 
pertaining instructions are executed on a processor of an object 
architecture . 

25. A program conversion method as claimed in claim 15, 
wherein the instruction reordering step includes : 

a first step of investigating an allocation situation 
of registers with regard to whether each of intermediate terms 
and variables in the intermediate program is cnorriinated with 
a register or a memory; 

a second step of replacing a branching instruction 
positioned at the tail end of a basic instruction which is a 
processing object at present with a mntrof spectilation mode 
PORK instruction while the fork destination which is an operand 
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of the control speculation morie WKK instruction is determinod 
a8 the fork destination selected by the fork spot deLermination 
step; 

a third step of moving a branching condition expresdiuu 
5 p02»iLioned immediately prior to the control speculation mode 
FORK instruction in the intermediate instruction to the uuaiLion 
immediately next to the nnntroi speculation mode PORK 
instruction and inserting to zhe tail end of the baaiu block, 
which is zhe destination of -he movement ox the branching 

iO condition expression, on instruction sequence for endin^f, when 
the branching condition is sati 53f i ed , the self thread andplacing 
a child thread into a settlement mode which is a aou-cuuLx'ul 
speculation mode, hut abflnrinning, when the branching condition 
is not satisfied, the child thread and keeping the self thread 

15 to continue execution of a siirreedJng instruction train; 

a fourth step of moving each of instruction slaLemeuLs 
which are on the upstream .side with respect to the control 
speculation mode FORK instruction in the basic block being a 
processing object at present and are to be substituted into 

20 intermediate terms and variables coordinated with a memory Lu 
a position on the downstream sirie with respect to the control 
speculation PORK instruction and inserting a BLOCK setting 
instruction to the position immediately prior to the control 
speculation mode FORK instruction while inserting a BLOCK clear 

25 instruction to the position immediately next to the movement 
destination of the substitute statement; and 
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a fifth 3tep of issning an instruction to allocate the 
registers in accordance with the register allucalion situation 
assumed by the fork conver^^inn processing in the second step. 

26 . A program converaion method claimed in claim 15 , 
wherein the fork spot determination step includes: 

a first step of discriminating whether ur not the 
coiidi(;iunal branching instmrtion corresponds to a return 
branch of a loop structure in the intermediate program; 

a second srep of nrovTsionaiiy determining, when t;hc 
conditional branching instruction correbpunds zo a return 
branch of a loop structurR, the direction of the return branch 
as a fork destination; 

d third step of calculating, based on the received profile 
information, a probability with which a taken side of the 
conditional branching instruction is selected and another 
probability with which a fall-through side of the conditional 
branching instruction is selected; 

a fourth step of discriminating whether or not the two 
calculated probabilities of the branches have a difference 
greater than a predetermined value; 

a fifth step of provisionally determining, if the 
difference between the two probabilities of the branches is 
greater than the predeterminRd value , the branch which exhibits 
the higher probability as a fork destination; 

a sixth stepof cal mil ;i ting a minimum value of the distance 
of data dependence for each of the two branching destinations 
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of the conditional branch; 

a seventh step of comparing the two minimum values of 
the distance of data dependence determined with rftgrprd to the 
two branches of the conditional branch with each other to 
5 dibcrifflinate whether or not the two mininnim values have a 
difference greater than a predetermined value; 

an eighth step of determining, when the two minimum values 
of the distance of data dependence have a difference greater 
Lhaii uhe predetermined value or no dai^ft fiftpenhence is found, 
10 that one of che branches which exhibits a higher one of the 
minimum values of the distance of rtepeTidence as a fork 

destination; 

a ninth step of calculating a minininm va tiie of the distance 
of data dependence for the fork destination provisionally 
Q ,15 detemiued in the second step or the fifth step and di scri mi nat ing 

^ whether or not the minimum value of the distance of data 

:^ dependence of the provisionally determined fork destination 

f=^ is equal to or higher than a predetermined value; 

a tenth step of settling, if the minimum value of the 
20 distance of data dependence of the provisionally determined 
Turk destination is equal to or highfsr than a predetermined 
value or no data dependence through a memory is found, the fork 
destination provisionally determined in the second step or the 
fifth step as a- formal fork destination; 
25 an eleventh step of excepting, when it i s determined that 

the minimum value of distance of data dependence is lower than 
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the predtttenniued value, the basic block from a fork spot; 

a twelfth step of calculating a data dependence occurrence 
frequency fron the received profile information; 

a thirteenth step of discriminating whether or not the 
data dependence occurrence frequency is equal to or higher thfln 
a fixed level, counting, when the data dependence occurrence 
frequency is lower than the fixed level, the number nf 
intermediate terms/ variables on the memory which may possibly 
cause data dependence from the basic block of the fnrk smirce 
to the basic block of the fork destination xn the intermediate 
program, discriminating whether or not the nmint vaine 
representing a data -dependent spot number is equal to or higher 
than a fixed level, determining that a fork arrnrding the DSP 
system is used if the data-dependent spot number is equal to 
or higher than the fixed level but determining that a fork 
according to the BLOCK system is used if the data-dependent 
spoL number is lower than the fixed level, and providing 
information of the fork to the FORK instruction in the 
intermediate program; and 

a fourteenth step of countings when the data dependence 
occurrence frequency is equal to or higher thfln the fi Yftd lev©! , 
the number of data -dependent variables on the memory and 
determining that a fork according to the RT.OCK system is used 
if the counted number is smaller than a fixed level but removing 
the basic block from a fork candiriatfi if the counted ntimber 
is oqual to or greater than the fixed level. 
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27 . A program conversion method as claimed in claim 15. 
wherein the instruction reordering step includes: 

a first step of investigating an ^» I location situation 
of registers with regard to whether each of intermediate terms 
and variables in the intermediste program is coordinated with 
a register or a memory; 

a second step of replacing a branching instruction 
positioned at the tail end of a basic instruction which in a 
processing object at present wi-n a control speculation mode 
PORK instruction while the fork destination which is aa operand 
of the control speculatinn mode FORK instruction is determined 
as the fork destination selected by the fork spot deteriuiualion 
step; 

a third step of moving a branching condition expression 
positioned immediately prior to the control speculation FORK 
instruction in the intermediate instruction to the position 
immediately next tn the control speculation FORK instruction 
and inserting to the tail end of the basic block, which is Lhe 
destination of the movement of the branching condition 
expression, an instruction sequence for endiuff, when Lhe 
branching condition is satisfied, the self thread and placing 
a child thread into a settlement mode which is a lion- control 
speculation mode , but abandoning, when the branching condition 
is not satisfied, the child thread and keeping Lhe aelf Lhread 
to continue execution nr a succeeding instruction train; 

a forth 3tep of checking whether a fork daLu dssurance 




system of Llie fork spot determined Dy the fork spot determination 
step is the BLOCK system or the DSP system; 

a rif th step of moving . when the fork data assurance system 
is the BLOCK systwa, a memory store statement prior to the fork 
5 to a positiou aXLer Lhe fork, inserting necessary BLOCK setting 
and BLOCK clear instructions, inspecting, upon the movement, 
'd data dependence relationship and moving only those 
instructions a change of whose instruction execution order does 
aoi lihauKe i result of arithmetic operation; 

10 a sixth stop of modifying, when the fork dara assurance 

system is Lhe DSP system, the yORK instruction produced by 
replacement in the second step so that a substitute statement 
into an intermediate term coordinated with a memnry is mnveci 
to a position next to the PORK instruction to perform the fork 

15 in a da La -dependent speculation mode; and 

a seventh step of issuing an instruction to allocate the 
registers in accordance with the register allocation situation 
assumed by the fork conversion process in the second step. 
28. A recording medium on which a program for causing 

20 a computer to perform an optimization process including 
paralltrlization for an intermediate program outpurted as a 
result of a syntax analysis on a compiler which compiles a source 
program and produces and outputs a target program for a 
multi- thread processor apparatus is recorded, the optimization 

25 process including: 

a register allocation trial process of trying register 
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allocation prior to paralle 1 i 7fltion to estimate a register 
allocation situation of variables and intermediate oerius of 
the iiiLtsrmisdiate proffram; 

a fork spot dctcrsdnation procesa of deterainluK bat&ed 
ou a result of the register allonation trial whether or not 
a conditional branch portion of the intemediate program s^huuld 
be cuu verted into a parallel code for which a thread creation 
instruction la used or performing determination of whether ur 
not lae conditional branch portion snould be converted into 
a parallel code and, when 3uch conversion should be perfunned, 
deLermination of a parallel i 7«ti on execution method; 

an instruction reordering process of coaverliuK Che 
coudiliuxial branch portion in the intermediate program into 
a parallol code for which the thread creation instrucliuu is 
ujjed based on a result of the determination by the fork spot 
determination step and referring to the result of Ihe register 
allocation trial to insert an instruction for assuring a 
data -dependent relationship between threads through a memory 
iuLo positions before and after the thread creation instruction 
and reorder the ins t rue t i ona before and after the thread c r ea t i on 
instruction so that thread r-reation may be performed in an early 
stage ; and 

a register allocation process of performing definite 
register allocation so that the same allocation result as lhai 
upon the register allocation trial with regard to whether a 
physical register ia allocated aiay be obtained fur uhe 
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parallelized and reordered instruction sequence. 

29. A recording medium as claimed in claim 28, wherein 
the fork spuL determination process investigates a data 
dependence relationship through a memory from a basic block 
5 in the intermediate prugraffi ii4iich is a processing object at 
present to each of basic blocks of branching destinations of 
a conditional branching instruction positioned at the rai 1 ftnn 
of the basic block, counts, for each of the branching destinations 
the instrucLiuii icep number, rrom the ^op of 'ihe nranrning 
10 destination basic block, of che inBtniction at t;he zop one of 
memory rererence instructions in the branching destinatron 
basic block which cause the data dependence, and selects that 
cue uf the branching destination basic blocks whnsft instruction 
step number is greater as a now thread to be executed parallelly . 
13 30. A medium as claimed in claim 17, wherein the fork 

spot determination process determines the position of a 
data -dependent instruction through a memory in eflch branching 
destination basic block using a value obtained by accximulating 
esLimaLed execution cycle numbers of instructions in place of 
20 the instruction process number. 

31. A medium as claimed in claim 28, whfsrRin, upon 
conversion from a source program into a target program first 
by said compiler, address coordination information for 
establishing coordination between the basic blocks of the 
25 inlermediate program and machine language adrtrpsses of the 
target program to be outputtcd is outputted together with the 
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target program, and a processor apparatiis which is to execute 
th0 object program roads in the target program and the address 
coordination Information and executes th^ target program and 
then outputs profile information including branch profile 
information between basic blocks upon the exemt i on of the target 
program and data dependence information occurring through a 
memory between the basic blocks , whereafter . when said compiler 
parallelizes the source program to convert the source program 
into a target program , che fork spnr rleterroinat ion process refers 
to the profile information to preferentially select a branchiiig 
destination basic block r.n which control flows in a high 
probability at a conditional branch and another branching 
destination basic block with which data dependence occurs in 
a low probability at a conditional branch as a new thread Lu 
be executed parallel! y. 

32. A medium as claimed in claim 01, wherein the fork 
spot determination process produces ;»ti instruction to cause 
a conditional branching destination basic block selected as 
an execution start point of the new thread to be executed 
parallclly to temporarily block, when the number of spots of 
different memory addresses which cause date dependence is 
smaller than a predetermined number based on a result of an 
analysis of data dependenre through a memory in the intermediate 
program and a data dependence occurrence probability obtained 
from the prnfi le in format ion, load operation of the new thread 
from the memory addresses, but investigates, when Lhe number 
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uf apola of different memory addresses which cause fiata 
dependence is equal to or greater than the predetermined number , 
whether ur uut the data dependence occurrence probability 151 
lower than a predetermined probability and producce, if the 
probabiliL.y is lower, an instruction to create a new tbreftri 
in the data -dependent speculative mode but controls, if the 
probability is equal 1:0 or higher than the prerietRrmiTiecl 
probability, so as zo stop the parallclization conversion at 
the :>uuu . 

33. A compiler as claimed in claim 31, wherein uhe fork 
spuL determination process investigates a data dftpRnrience 
relationship through a memory from the basic block in the 
intermediate program currently which is 3 prnressing object 
at present to each of the branching destination basic blocks 
or Lhe conditional branching instruction positioned at the tai 1. 
end of the basic block and synthesizes the investigated data 
dependence relationship and the cnnriitionai branching 
probability obtained from the profile information, and if a 
result of the synthesis reveals that the hranrhingprobabilities 
regarding the branching destination basic blocks at the 
conditional branch do not have a difference greater than a 
predetermined omotint and data dependence occurrence timings 
through a memory do not have a dif ferRnre greater than a 
predetermined amount , said fork spot determination section 
determines so as not to parallelize the cnnriitionai branching 
portion . 
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